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'v. 

1.  Introduction  x 

Remote  sensing  of  the  atmosphere  Is  a  rapidly  developing  science. 
Today's  meteorological  satellites  such  as  those  in  the  TIROS-N  series  have 
high  resolution  instruments  on  board  which  measure  the  intensity  of 
upwelling  radiation  in  selected  channel  frequencies.  A  description  of  the 
data  retrieved  by  the  radiometers  on  the  TIROS-N  type  satellites  can  be 
found  In  [7].  From  these  data  it  is  possible  to  obtain  Information  on  the 
atmosphere's  temperature,  moisture  and  wind  structure.  One  of  the  goals  of 
the  current  Satellite  Meteorology  program  Is  to  improve  the  quality  of 
atmospheric  information  obtained  from  satellite  soundings  to  a  point  where 
it  can  be  used  for  weather  forecasting  purposes.  A  major  challenge  in  this 
direction  is  to  develop  refined  numerical  and  statistical  methods  for 
Inverting  the  equations  of  radiative  transfer  given  a  finite  number  of 
noisy  measurements. 

For  a  non-scattering  atmosphere  In  local  thermodynamic  equilibrium  the 
radiative  transfer  equations  (RTE's)  describe  how  the  satellite  upwelling 
radiance  measurements  relate  to  the  underlying  temperature  distribution  T:- 

.  P°  d 

3vm  ■  BvCT(p0)]rv(p0)  -  /  Bv[Tlp)^v(p)<lp  (1.1) 

where  pQ  is  the  surface  pressure,  T^(p)  is  the  transmittance  of  the 
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Bv[T{p)]  =  c1v3/{exp(c2v/T(p))  -  1} 

cx  =  1.19061X10"5erg-cm2-sec-1  (1.2) 

c2  =  1.43868  cm-deg(K) 

The  R.T.E’s  are  of  course  an  idealization.  They  describe  the  inten¬ 
sities  the  satellite  radiometer  would  record  In  the  absence  of  such  things 
as  atmospheric  attenuation  due  to  clouds  or  Instrument  noise.  However, 
by  using  high  resolution  radiometers  like  the  HIRS  or  AVHRR,  sets  of 
Intensity  measurements  from  many  FOV's  (fields  of  vision)  can  be  combined 
to  obtain  data  of  the  form 

Z1  *  +  ei  *  c  1»***.n  (1.3) 

where  e^'s  are  errors.  These  data  relate  to  an  area  of  about  119  by  140  km 
on  the  earth's  surface.  See  [6]  for  more  details. 

We  are  Interested  In  refining  the  methods  used  to  obtain  temperature 
distribution  estimates  from  the  above  data.  The  procedure  currently  used 
to  process  TIROS-N  temperature  sounding  data  Is  a  linear  regression 
technique  see  [6].  Here  we  begin  to  discuss  how  the  method  of  regulariza¬ 
tion  (M.O.R. )  might  be  used  to  Improve  the  quality  of  temperature  profiles 
obtainable  by  this  procedure. 
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Let  T  be  the  true  temperature  profile  In  the  atmosphere.  Then  T  can  be 
written  as 

T  *  Tq  +  6  (1.4) 

where  TQ  Is  the  current  best  guess  of  T  and  4  Is  the  update  or  correction 
to  T0  to  be  estimated  from  the  data  {z^}  in  hand.  Using  M.O.R.  to  estimate 
6  involves  consideration  of  a  functional  1^  given  by 

pQ 

1,(4)  =  [  [zrl(V6)]2  +  X  /  [6(n,)(p)]2dp  (1.5) 

and  picking  the  estimated  update  <5^  to  minimize  this  functional*  over  some 
class  of  physically  plausible  candidates,  for  Instance  the  set  of  functions 
6  in  W2mtO,pQ]  for  which  TQ+6  is  positive  or  perhaps.  If  the  location 
of  the  temperature  inversion  were  reliably  known,  one  would  look  for  mini¬ 
mi  zers  of  IA  subject  to  an  additional  constraint  involving  temperature 
inversion. 

The  statistical  reasoning  for  considering  regularized  estimates  of  this 
type  Is  well  documented  in  the  literature,  see  for  example  [3]  and  [1]. 
Intuitively  6X  has  been  designed  to  match  the  observed  data  and  possess 
certain  smoothness  qualities.  The  parameter  X  controls  a  tradeoff  between 


the  smoothness  of  a  solution  (measured  by 


PQ  /_%  2 

/  [4,  ip)]  dp)  and  how  well  It 
0  A 


“■ '  2 

[1]  This  corresponds  to  the  case  when  the  measurement  errors  are  lid  N(0,o  ). 

A  more  "robust"  method  would  be  to  consider  functionals  of  the  form 

I,(«)  •  l  pCz.-I  (Tn+4)]  ♦  x  /°C«(*,(pn2dp 
A  i-1  1  M  0  0 

where  o  reflected  the  possible  non-Gausslan  nature  of  the  noise. 


52 


matches  the  data  (the  l  [z.-  $  (Tft+6.)32  term). 

1=1  1  M  0  * 

Inverting  the  R.T.E.'s  with  noisy  data  can  be  viewed  as  a  special  case 
of  a  more  general  situation  In  which  the  scientist  wishes  to  estimate  a 
function  x  given  data 


zi  *  Mjfx)  +  e^  1  *  l,...,n  (1.6) 

where  x  Is  In  some  Hilbert  space  H,  the  H^'s  are  non-linear  functionals 
and  e^'s  are  noise.  Here,  assuming  the  e^'s  are  lid  N(0,o2),  an 
appropriate  regularization  function  1^  Is 

n  , 

I  (x)  =  l  [z.-N.(x)]  +  XJ(x)  (1.7) 

*  1-1  1  1 

where  J  Is  a  roughness  penalty  functional  on  H.  To  estimate  x  one  proceeds 
to  minimize  1^  over  some  subset  of  physical  Interest  In  H.  This  report 
summarizes  recent  results  we  have  obtained  on  the  existence  and  numerical 
approxlmablllty  ^  nifnlmlzers  of  such  I^'s  In  certain  subsets  of  H.  He 
Indicate  how  these  results  apply  to  the  radiative  transfer  equations  case. 

There  are  three  sections:  section  2  talks  about  the  existence  theory; 
a  Gauss-Newton  algorithm  for  minimizing  the  regularization  functionals  is 
outlined  in  section  3,  while  the  final  section  briefly  describes  how  to 
estimate  the  smoothing  parameter  using  a  first  order  approximation  to  toe 
generalized  cross  validation  function  given  In  [8].  He  assume  the  reader 
Is  familiar  with  toe  basic  mathematical  tools  for  discussing  minimization 
problems  In  Hilbert  spaces.  Part  1  of  Ekeland  and  Temam's  book  [2]  Is  an 
Inspiring  Introduction  to  this  subject. 
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2.  Existence  Theory 
Prel  Imlnarles 

Before  describing  our  main  results,  let's  pause  a  moment  to  get  our 
notation  straight.  H  is  a  real  Hilbert  space  with  Inner  product  <*,*>  and 
norm  ||*||  (so  <x,x>-| |x| |2).  P  Is. a  projection  operator  In  H  with  finite 
dlmentlonal  null  space;  the  complementary  projection  I-P  Is  denoted  by  PQ* 
H*  is  the  dual  space  of  H,  l.e.  the  space  of  all  continuous  linear  maps 
from  H  Into  R.  L(H,H*)  Is  the  space  of  linear  operators  from  H  Into  H*. 

We  will  discuss  functionals,  I  say,  acting  on  H  (so  I:  H+R).  The  first  and 
second  Frechet  derivatives  of  I  at  some  point  xeH  will  be  denoted  by  I'(x) 
and  I"(x)  respectively.  Think  of  I‘(x)  as  an  element  of  H*  and  I"(x)  as 
an  element  of  L(H,H*) .  Our  concern  here  Is  with  regularization  functionals 
Ix  on  H  given  by 

I.(x)  -  ?  [zrN<(x)32  +  XllPxll2  (2.1) 

X  1-1  1  1 

where  M^'s  are  functionals  on  H,  z^'s  are  InR,  xeH  and  X>0.  Whenever  we 
write  the  form  (2.1)  will  be  what  Is  meant.  So  we  are  considering  regu¬ 
larization  procedures  In  which  the  roughness  penalty  J(x)  Is  a  semi-norm  on 
H  given  by  J(x)  •  ||Px||2. 

Win  Results 

We  now  specify  conditions  on  the  non-linear  functionals  Nj  which 
guarantee  the  existence  of  mlnlmlzers  of  Ix  In  closed  convex  subsets  K  of 
H.  In  the  R.T.E.  case  a  reasonable  choice  for  K  Is  the  set  of  all  func¬ 
tions  In  W2"[0,p0]  fbr  which  Is  positive.  It  Is  very  easy  to  check 
that  this  K  Is  a  closed  convex  subset  of  W2*[0,p0i  for  any  m.  Our 
existence  results  are  summer) zed  in  the  following  three  theorems. 
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Theorem  1  (proof  In  [2]  pp.  34-35). 

Let  K  be  a  closed  convex  subset  of  a  Hilbert  space  H.  Suppose  1^:  K+R  Is 
coercive  on  K  (l.e.  I.  (x)+-  as  I  lx  I  It-  In  K)  and  moreover  that  I  Is  weakly 

A  A 

lower  semi-continuous  (w.I.s.c.)  on  K  then  1^  attains  Its  Inflmum  on  K. 


Theorem  2  (proof  In  [4]) 

Let  j:  R+R  be  a  monotonic  Increasing  function  In  the  modulus  of  Its  argu¬ 
ment.  Suppose 
n 


(1)  I  ♦(N.(x))  Is  convex  on 
1-1  1 


(11)  l  4>[n.(x)3  *  ♦<*>  P  x  -  P  e  for  some  0 
1-1  1  oo 


then  Ix  Is  coercive  on  K. 


Remark:  The  above  theorem  can  be  generalized  somewhat  but  we  refrain  from 
doing  so  because  the  form  given  has  more  Intuitive  appeal. 


Theorem  3 

If  Nj  Is  weakly  continuous  (w.c.)  on  K  for  each  1  then  Is  w.I.s.c.  on  K. 

n  , 

Proof:  If  the  N,  are  w.c.,  then  It  surely  follows  that  f  [z,-N,(x)]  Is 
i  1-1  1  1 

2 

w.c.  But  llPxll  Is  well  known  to  be  w.I.s.c.  Therefore  1^  Is 
w.I.s.c.  QED 


■f  f  if  A. 


55 


Application  to  the  R.T.E.'s  (see  [4]  for  details) 

The  arising  here  can  be  shown  to  satisfy  the  hypotheses  of  Theorem  2 
with  4  taken  to  be 

*(x)  »  lx|,  xeR 

Also,  each4y  is  w.c.  We  therefore  have  that  for  each 

X>0,3fixeK  =  {ScW^O.p^  Y6>0},  S.t. 

p0 

1.(6.)  -  min  [l  [z.-i  (Tn+«)]2  +  \[  [6(m)(P)]2dp} 

X  X  6cK  i-1  1  M  U  0 

There  exist  regularized  solutions  to  the  R.T.E.'s. 

3.  A  numerical  procedure  for  minimizing  Ix  in  K 


L  4k 

Let  x*  be  the  k  approximation  to  the  minimizer  in  K  of  lx>  Define 

k 

the  functional  Ix  on  K  as  follows 


I.K(x)  -  l  [zl-M,(xk)-N;(xlc)[x-xk]]2  +  xllPxll 
x  i»l  1  1  1 


(3.1) 


k  k+1 

each  Nj  is  simply  linearized  about  x  .  Define  x  to  be  the  minimizer  In 
K  of  Ixk. 

k 

Under  suitable  regularity  conditions  the  Iterates  x  are  well  defined 
and  can  be  shown  to  satisfy 


xk+1  -  xk  -  {  t  Ni(xk)Wi(xk)  +  x<P.,.>}_1i;  (xk) 
1-1  1  1 


i  xk  -  A'1(xk)r(xk) 


(3.2) 
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That  this  equation  makes  good  sense  Is  evident  once  one  realizes  that  A(x  ) 

belongs  to  L(H,H*)  and  ^'(x*)  is  In  H*. 

Those  In  the  know  will  have  recognized  that  the  above  procedure  Is 

nothing  more  than  an  Infinite  dimensional  version  of  the  Gauss-Newton 

algorithm.  The  finite  dimentlonal  case  Is  discussed  In  [5].  The  major 

advantage  of  using  a  Gauss-Newton  procedure  to  minimize  our  regularization 

functionals  Is  the  ease  with  which  successive  Interates  can  be  obtained. 

At  each  stage  we  have  a  regularization  problem  involving  linear  func- 
(c 

tlonals,  the  N{'(x  )'s,  consequently  we  can  take  advantage  of  available 
software  tools. 

With  the  appropriate  assumptions  It  is  possible  to  show  that  the  proce- 

k 

dure  Is  a  decent  method  and  the  sequence  x  converges  at  least  R-llnearly 
to  a  critical  point  of  Ix  In  K. 

Theorem  4  (proof  In  [4]). 

Suppose  that  the  N^M's  are  twice  continuously  differentiable  and 
N{  ( .)• s  are  w.c.  on  Int  K.  Let  x°c  Int  K  be  such  that 

L°  •  {x|lx(x}  <  Ix(x0)} 

Is  weakly  compact  and  Ix  has  only  finitely  many  critical  points  In  1°  . 
Moreover,  suppose  that  Uq»v^,y^  all  positive  with  Ug-l^r^O  satisfying 

M0llh||2  <  <h,A(x)h>  <  u-,  llhll2,  lx"(x)hh  <  r,  I  lh|  |2  YxcL°,  hcH 

k  k  * 

then  the  sequence  of  Iterates  {x  }Q  L°,  11*  x  *  x*  where  l.'(x*)  s  0  and 
*  k 

If  Ix*(x  )  non-singular,  then  the  convergence  Is  at  least  R-l inear. 

The  proof  follows  an  argument  similar  to  that  used  In  14.4.6  of  [5]. 


57 


4.  The  choice  of  X 

The  generalized  cross  validation  method  for  choosing  x  works  as  follows. 
Let  be  the  minimized  In  K  of 

I  Cz.-M.(x)]2  +  xHPxll2  (4.1) 

1-1  1  1 
14k 

Then  x  Is  chosen  to  minimize 


v(x) 


where  N^fx^)  is  the  prediction  of  zfc  given  the  data  z]  ,z2 . z|{_1 

z  ■ , . .  •  .z_  and  a*(x)  Is  the  "differential  Influence"  of  the  z.'th  data 

K+ 1  fl  KK  K 

point  on  the  estimate  xx  (x^  Is  the  minimi zer  In  K  of  I^). 


V11 


(4.3) 


From  a  computational  viewpoint  V(x)  Is  prohibitively  expensive  so  one  needs 
to  find  some  convenient  approximation.  Following  Wahba  [8],  V(x)  can  be 
approximated  by 


Vapprox*** 


Cl-u^x))2 


(4.4) 


C21  Assumed  to  be  uniquely  defined 
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i 

.  i 


1 

J 


where  wi  given  by 


,  n  3N,,(x.) 


is  an  easily  computed  functional  of  x^.  We  hope  to  study  this  procedure 
more  closely  in  the  near  future. 
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